Servant of Many Masters: Shifting priorities in Pareto-optimal sequential decision-making
نویسندگان
چکیده
It is often argued that an agent making decisions on behalf of two or more principals who have different utility functions should adopt a {\em Pareto-optimal} policy, i.e., a policy that cannot be improved upon for one agent without making sacrifices for another. A famous theorem of Harsanyi shows that, when the principals have a common prior on the outcome distributions of all policies, a Pareto-optimal policy for the agent is one that maximizes a fixed, weighted linear combination of the principals' utilities. In this paper, we show that Harsanyi's theorem does not hold for principals with different priors, and derive a more precise generalization which does hold, which constitutes our main result. In this more general case, the relative weight given to each principal's utility should evolve over time according to how well the agent's observations conform with that principal's prior. The result has implications for the design of contracts, treaties, joint ventures, and robots.
منابع مشابه
Toward negotiable reinforcement learning: shifting priorities in Pareto optimal sequential decision-making
Existing multi-objective reinforcement learning (MORL) algorithms do not account for objectives that arise from players with differing beliefs. Concretely, consider two players with different beliefs and utility functions who may cooperate to build a machine that takes actions on their behalf. A representation is needed for how much the machine’s policy will prioritize each player’s interests o...
متن کاملData Clustering of Solutions for Multiple Objective System Reliability Optimization Problems
This paper proposes a practical methodology for the solution of multi-objective system reliability optimization problems. The new method is based on the sequential combination of multi-objective evolutionary algorithms and data clustering on the prospective solutions to yield a smaller, more manageable sets of prospective solutions. Existing methods for multiple objective problems involve eithe...
متن کاملInputs and Outputs Estimation in Inverse DEA
The present study addresses the following question: if among a group of decision making units, the decision maker is required to increase inputs and outputs to a particular unit in which the DMU, with respect to other DMUs, maintains or improves its current efficiencylevel, how much should the inputs and outputs of the DMU increase? This question is considered as a problem of inverse data envel...
متن کاملOn the Optimal Solution Definition for Many-criteria Optimization Problems
When dealing with many-criteria decision making and many-objectives optimization problems the concepts of Pareto optimality and Pareto-dominance are inefficient in modelling and simulating human decision making. Different fuzzy-based definitions of optimality and dominated solution are introduced and tested on analytical test cases in order to show their validity and closeness to human decision...
متن کاملMulti-objective Solution Approaches for Employee Shift Scheduling Problems in Service Sectors (RESEARCH NOTE)
Today, workforce scheduling programs are being implemented in many production and service centers. These sectors can provide better quality products and/or services to their customers, taking into account employees’ desires and preferences in order to increase sector productivity. In this study, an employee shift scheduling problem in the service sector is discussed. In the problem, the aim is ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1711.00363 شماره
صفحات -
تاریخ انتشار 2017